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Abstract. Given a collection "if of subsets of a finite set X, let |J "if = Ugg-^S'. 
Philip Hall's celebrated theorem [2j concerning 'systems of distinct representa- 
tives' tells us that for any collection of subsets of X there exists an injective 
(i.e. one-to-one) function / : — > Jf with f{S) G S for all S ii and and 
only if satisfies the property that for all non-empty subsets of we have 
lU'i^'l > \'^'\- Here we show that if the condition lU*^'! > l^'l is replaced 
by the stronger condition | U*^'! ^ T'^'l + 2, then we obtain a characterization 
of this condition for a collection of 3-element subsets of X in terms of the 
existence of an injective function from to the vertices of a tree whose vertex 
set includes X and that satisfies a certain median condition. We then describe 
an extension of this result to collections of arbitrary-cardinality subsets of X. 



1. First result 



Given a tree T = [V, E) and a subset S* of F of size 3, say S — {x, y, z}, consider 
the path in T connecting x, y, the path connecting x, z and the path connecting 
y, z. There is a unique vertex that is shared by these three paths, the median vertex 
of S in T, denoted medT(<5'). 

Theorem 1.1. Let X he a finite set, and suppose that '(g C (^), anrf U*^ = ^• 
The following are equivalent: 

(1) There exists a tree T — (V, E) with X (~V for which the function S > mcdT(»5) 
from to V is injective. 

(2) There exists a tree T = (V, E) with X as its set of leaves, and all its other 
vertices of degree 3, for which the function S <—>■ medT(<S') from "rf to the set 
of interior vertices of T is injective. 

(3) satisfies the following property. For all non-empty subsets of we 
have: 

(1) ||J^'|>KI + 2. 

In order to estabUsh Theorem ll.il we first require a lemma. 
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Recall, from , that a collection V of subsets of a set M forms a patchwork if 
it satisfies the following property: 

A,B eV and Ar\B ^{l) => Ar\B,A\J B <eV. 

Lemma 1.2. Let X be a finite set, and suppose that C (^) , and U ^ = X. If 
^ satisfies the condition described in Part (3) of Theorem ] 1.1\ then the collection 
V of non-empty subsets of ^ that satisfy \ 1J^'| = \'^'\ + 2 forms a patchwork. 

Proof: Suppose ''^i,'^2 C and that "^i n ^2 7^ 0- Consider 

if := ||JCrin^2)| + lU(^iU^2)|. 

By ([T]) we have: 

(2) K > d"^! n -Tal + 2) + (l-^i U ^2| + 2) = + |^^2| + 4, 
and we also have: 

(3) K < KlJ^i) n (U^2)| + KU*^!) u (U^2)| = lU^il + lU^2|. 

Notice that the right-hand term in ([2]) and in ([3]) are equal, since | IJ = |^i| + 2 
as G "P for i = 1,2, and thus, the inequality in ([2]) is an equality, and so 
I UC^i n ^2)1 = l-Ti n -^2! + 2 and I U(^i U -^2)1 = l"^! U ^2! + 2, as required. □ 

Proof of Theorem The implication (2) => (1) is trivial. For the reverse 
implication suppose that T satisfies the property described in (2). First delete 
from T any vertices and edges that are not on a path between two vertices in X. 
Next attach to every interior (non-leaf) vertex v £ X a new edge for which the 
adjacent new leaf is assigned the label x, and henceforth do not regard v as an 
element of X. Next replace each maximal path of degree 2 vertices by a single 
edge. Finally replace each vertex v of degree d > 3 by an arbitrary tree that has 
d leaves that we identify with the neighboring vertices of v and whose remaining 
vertices have degree 3. These four processes result in a tree T' that has X as its 
set of leaves, and which has all its remaining vertices of degree 3 (i.e. a 'binary 
phylogenetic X-tree' [3]) and for which the median vertices of the elements of ^ 
remain distinct. Thus (1) and (2) are equivalent. 

Next we show that (2) ^ (3). Suppose T satisfies the condition (2) and that 
is a non-empty subset of Consider the minimal subtree of T that connects 
the leaves in IJ*^'. This tree has at least j*^'! interior vertices that are of degree 
3. However by a simple counting argument, any tree that has k interior vertices of 
degree 3 must have at least k + 2 leaves, and so ^ holds. 

The remainder of the proof is devoted to establishing that (3) (2). We use 
induction on n := \X\. The result clearly holds for n = 3, so suppose it holds 
whenever \X\ < n, n > 4 and that X is a set of size n. For x £ X, let n<^{x) be the 
number of triples in that contain x. If there exists x € X with n<^(a;) = 1, then 
select the unique triple in containing x, say {a, b, x} and let X' = X ~ {x}, ^' = 
'T^—{{a, b,x}}. Then IJ = X' and satisfies condition ^ and so, by induction, 
there is a tree T' with leaf set X' for which the median vertices of elements in 
are all distinct vertices of T'. Let T be the tree obtained from T' by subdividing 
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one of the edges in the path in T' connecting a and 6 and making the newly-created 
vertex of degree 2 adjacent to x by a new edge. Then T satisfies the requirements 
of Theorem ll.lf 2). and thereby establishes the induction step in this case. 

Thus we may suppose that n<i^(a;) > 1 holds for all x ^ X . In this case we claim 
that there exists x & X with nc^{x) = 2. Let us count the set 17 :— {{x,S) : x G 
S G 'rf} in two different ways. We have 



where the latter inequality follows from Inequality ^ applied to = Com- 
bining Q and ^ gives 2k + 3{n — k) < 3n — 6, and so fc > 6. Thus, since fc > 0, 
there exists x ^ X with n<g'(x) = 2, as claimed. 

For any such x X with n<^(a;) = 2, let {a, 6, x}, {a' , b' , x} be the two elements 
of containing x. Without loss of generality there are two cases: 

(i) a = a' ,b ^ b'; or 

(ii) {a,5}n{a',6'} = 0. 

In case (i) let 

X' ■.= X- {x}, 'g' ■= - {{a, 6, a;}, {a, 6', x}}, ;= U {{a, 5, 6'}}. 

Note that \]^\ — X' . Suppose that fails to satisfy the condition described 
in Part (3) of Theorem 11.11 Then there is a subset of "^i that violates Inequality 
(H]) of the form 'i^^ U {{a, 5, 6'}} where a, 5, 6' G U*^^ and C <^'. But in that 
case U {{a, 6, x}, {a, 6', x}} would violate Inequahty ([1]), which is impossible 
since Inequality ([T]) applies to this set, being a non-empty subset of Thus, 
satisfies Part (3) of Theorem ll.il Since IJ = X', which has one less element than 
X, the inductive hypothesis furnishes a tree T' with leaf set X' that satisfies the 
requirements of Theorem [TTTJ2). Now consider the edge of T' that is incident with 
leaf b' . Subdivide this edge and make the newly created midpoint vertex adjacent 
to a leaf labelled x. This gives a tree T that has X as its set of leaves, and with all 
its interior vertices of degree 3; moreover the medians of the elements of are all 
distinct (note that the median of {x, a, 6'} is the newly-created vertex adjacent to 
x, while the median of {x, a, 6} corresponds to the median vertex of {a, b, b'} in T' 
and therefore is a different vertex in T to any other median vertex of an element of 



(4) 




xex 




2}|. On the other hand, 
\n\ ^3\'g\ <3(n-2), 



In case (ii), let 



X' ■.= X - {x}, := <^ - {{a, b, x}, {a', b' , x 



and 

-^^1 :='^'U{{a,a',5}},'^^2 := U {{a, a', 6'}}. 
Note that |J = IJ = X'. We wiU estabhsh the following 
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Claim: One or both of ^\ or '^2 satisfies the condition described in Part (3) of 
Theorem 11.11 

Suppose to the contrary that both sets fail the condition described in Theorem ll.ir S). 
Then there is a subset of ^\ that violates Inequality (H]) , and it must be of the form 

U {{a, a', 6}} where C , a,a',b G \J'^^ and b' ^ \]'€^ (the last claim 
is justified by the observation that if h' G then U {{a, 6, a;}, {a', 5', a;}} 

would violate the condition described in Part (3) of Theorem ll.l[) . Similarly a 
subset of that violates Inequality ([T]) is of the form U {{a, a', 6'}} where 

c <^', a, a', 6' G U '^#'2 and 6 ^ U '^2. Now, let V be the subset of defined in 
the statement of Lemma 11.21 Then the sets 

:= U {{x,a,6},{x,a',6'}}; and := y jj^, „^ ^ j^, ^/jj 

are both elements of V and they have non-empty intersection, since they both con- 
tain {cc, a, 6} (indeed they also share {x, a', 6'}). Thus, Lemma [L2l ensures that ^\C\ 
^1 is also an element of V . However ^\ r\^2 is of the form '^'^u{{x, a, 6}, {a;, a', 6'}} 
where C and neither x, nor 6, nor h' are elements of IJ'^'^ because by our 
choice of x, x only occurs in the two triples {x,a,b} and {x, a',6'}, and because 
&' ^ U "^^ and 6 ^ U "^^^ Since is a subset of "^^ satisfies Inequality (P) which 
implies that ([1]) must be a strict inequality for H , contradicting our assertion 
that n'i^2 G T'. This justifies our claim that either or satisfies the part (3) 
of Theorem O 

We may suppose then, without loss of generality, that satisfies part (3) of 
Theorem ll.il Since yj^^i — X', which has one less element than X, the inductive 
hypothesis furnishes a tree T' with leaf set X' that satisfies the requirements of 
Theorem Il.lf 2'). Now, consider the edge of T' that is incident with leaf a'. Sub- 
divide this edge and make the newly created midpoint vertex adjacent to a leaf 
labelled 2: by a new edge. This gives a tree T that has X as its set of leaves, and 
with all vertices of degree 3; moreover, regardless of where b' attaches in T the 
medians of the elements of '^^ are all distinct (note that the median of {x, a' , b'} is 
the newly-created vertex adjacent to x, while the median of {x, a, b} corresponds 
to the median vertex of {a, a', 6} in T' and therefore is a different vertex in T to 
any other median vertex of an element of '^). This completes the proof. □ 

2. An extension 

For a subset F of X of size at least 3, and a tree T — (V, E), with X CV, let 

medT(r) := {mcdT(5) : S* C F, IS*] = 3}. 

Thus, medriY) is a subset of the vertices of T, moreover if X is the set of leaves 
of T, then nicdT{Y) is a subset of the interior vertices of T. 

Theorem 2.1. Let X be a finite set, and suppose that is a collection of subsets 
of X , each of size at least 3, and with [J"^ = X . The following are equivalent: 

(1) There exists a tree T — (V, E) with X as its set of leaves, and all its other 
vertices of degree 3, for which {medT{Y) : Y G is a partition of the set 
of interior vertices ofT. 
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(2) ^£ satisfies the following property. For all non-empty subsets of we 
have: 

(6) lU^'|-2> E (1^1-2). 

and this last inequality is an equality when = 

Proof: We first show that (1) ^ (2). Select a tree T satisfying the requirements 
of Part (1) of Theorem l2.1l For a nonempty subset of the minimal subtree T' 
of T connecting the leaves in IJ has k := | IJ "^'1 leaves, and fc — 2 vertices that are 
of degree 3. By the partitioning assumption, each element Y ^ generates — 2 
median vertices in T and these sets of median vertices are pairwise disjoint for 
different choices of y e Moreover, distinct interior vertices of T correspond to 
different degree 3 vertices in T' , and so the number of degree 3 vertices in T' can be 
no smaller than the sum of |F| — 2 over all F £ This establishes Inequality 
For the case where = note that T has [J'^ = X as its leaf set, and by 
the partitioning assumption, each of its \X\ — 2 interior vertices occurs in one set 
medT{Y) for some Y £'r^ and so \X\ — 2 < J2Ye'^'(\^\ ^ 2) which, combined with 
dS]), provides the desired equality. 

To show (2) (1) select for each set F G ^ a collection '^y of 3-element subsets 
of X of cardinality |y | — 2 for which IJ Cy = Y, and that satisfies that condition 
that for every nonempty subset of '^y we have IJ > l"^'] + 2; such a selection 
is straightforward - for example, if y ~ {yi, . . . , ym} then we can take 

(7) '^Y = {{yi,y2,y3},{yi,y2,y4}, ■ ■ ■ ,{yi,y2,ym}}- 

We first establish the following: 

Claim: := Uy^'^'i^y is a collection of 3-element subsets of X that satisfies 
Inequality ^ in Theorem ll.il 

To see this, suppose to the contrary that there exists a subset of "^^^ for which 
Inequahty ^ fails. Write = SiU S2U ■ ■ ■ L> Sk where 1 < fc < l*^!, and where 
is a nonempty set of 3-element subsets of X that are selected from the same set (let 
us call it Yi) from 'if (note that the fact that | Yi U 1^2 1 - 2 > \Yi\ - 2 + \Y2\ - 2 must 
hold for aUri,r2 in implies that jFi | -h jFal - l^i 0^2! -2 > lYil -2-^1^21 -2 and, 
hence, 2 > |Yi n I2I must hold for all yi,l2 in '^)- By our assumption regarding 
the set of triples we have | U^"l < l"^"! + 1 and so, if we let Wi := IJ S^ we 
have U = lJi=i and consequently 

fc fc 

(8) \[jW,\<Y.\^,\ + l. 

i=l 1=1 

For := {Fi, . . . , Yk} C ^ we have 

fc fc 

(9) ||J"^'| >^(|r,|-2) + 2 = ^|y,|-2fc + 2. 

i=l i=l 
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On the other hand, 

k k k k 

I U^'i < I U + E(i^» -w.,\) = \[jwA + - \w,\), 

i—1 i—1 i—1 i—1 

since Wi C Yi. By the condition imposed on the construction of we have 
\Wi\ > \Si\ + 2 for each i, and so, substituting this, and ([5]) into the previous 
inequahty gives: 

k k k k 

iU'^'i<Ei^^i + i+Ei^»i-E(i^''i + 2) = Ei>^.i-2fc+i, 

i—1 i—1 i—1 i—1 

which compared with ^ gives 1 > 2, a contradiction. This estabhshes that C* 
satisfies Inequahty ([1]) in Thcorem ll.il 

By Theorem 1 1.1 1 it now folfows that there is a tree T = [V^ E) with leaf set X for 
which the function S i— > medT('S') is injective from to the set of interior vertices 
of T. Now, for y G "if , we have 

(10) medT(>") = {medTlS*) : C F, |5| = 3} = {medT{S) : S e Cy}. 

The second equahty in (jlO[) requires some justification. Recalhng our particular 
choice of Cy from ([7]) , and noting that the medians of the triples in Cy are distinct 
vertices of T it follows that T\Y has the structure of a path connecting yi,y2 with 
each of the remaining leaves y d Y — {yi, 1/2} separated from this path by just one 
edge. Consequently if a vertex u of T is the median of three leaves in Y, then it 
is also the median of a triple {j/i, y2, y} for some y Y — {j/i, ^2}; that is, it is an 
element of {medrlS") : S € Cy}. 

Consequently, {medT{Y) : Y G "^j are disjoint subsets of the set of interior 
vertices of T. Moreover each interior vertex of T is covered by {medxiY) : Y G 
since the number of interior vertices is |X| — 2 and by assumption |X| — 2 — 
Sye<^(l^l ~ 2) = 1"^, |. This establishes the implication (2) (1) and thereby 
completes the proof. □ 
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